Differentiating Homonymy and Polysemy in Information Retrieval

نویسنده

  • Christopher Stokoe
چکیده

Recent studies into Web retrieval have shown that word sense disambiguation can increase retrieval effectiveness. However, it remains unclear as to the minimum disambiguation accuracy required and the granularity with which one must define word sense in order to maximize these benefits. This study answers these questions using a simulation of the effects of ambiguity on information retrieval. It goes beyond previous studies by differentiating between homonymy and polysemy. Results show that retrieval is more sensitive to polysemy than homonymy and that, when resolving polysemy, accuracy as low as 55% can potentially lead to increased performance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Homonymy and Polysemy in Information Retrieval

This paper discusses research on distinguishing word meanings in the context of information retrieval systems. We conducted experiments with three sources of evidence for making these distinctions: morphology, part-of-speech, and phrases. We have focused on the distinction between homonymy and polysemy (unrelated vs. related meanings). Our results support the need to distinguish homonymy and p ...

متن کامل

Ontology-based Distinction between Polysemy and Homonymy

We consider the problem of distinguishing polysemous from homonymous nouns. This distinction is often taken for granted, but is seldom operationalized in the shape of an empirical model. We present a first step towards such a model, based on WordNet augmented with ontological classes provided by CoreLex. This model provides a polysemy index for each noun which (a), accurately distinguishes betw...

متن کامل

Homonymy and Polysemy in the Czech Morphological Dictionary

We focus on a problem of homonymy and polysemy in morphological dictionaries on the example of the Czech morphological dictionary MorfFlex CZ [2]. It is not necessary to distinguish meanings in morphological dictionaries unless the distinction has consequencies in word formation or syntax. The contribution proposes several important rules and principles for achieving consistency.

متن کامل

بررسی مشکلات جستوجو و بازیابی اطلاعات در پایگاههای اطلاعاتی از جنبه ویژگیهای نگارشی زبان فارسی

The present research was carried out with the aim of explicating the major writing and semantic problems of Persian language when using data environments and determining the degree of compatibility and attention to these features in Persian databases. This research is of survey analytical type being conducted through direct observation. Having reviewed the related literature, we kept a checkli...

متن کامل

Lexical ambiguity in L2: Homonymy and polysemy among Polish-English bilinguals

The M.A. study reported here aimed at examining the processing of isolated lexically ambiguous words in L2 by fluent Polish-English bilinguals. Just as many people speak more then one language (Cook 2002: 22), most words prove to have more than one meaning or sense (Rodd er al. 2004: 90). Consequently, it appears crucial to add the ambiguity factor to the analysis of the lexicon. Indeed, a vari...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005